Inverse spectral problem for normal 
matrices and a generalization of the 
Gauss-Lucas theorem 

S. M. Malamud 

Abstract. We establish an analog of the Cauchy-Poincare sepa- 
ration theorem for normal matrices in terms of majorization. More- 
over, we present a solution to the inverse spectral problem (Borg- 
type result). Using this result we essentially generalize and extend 
the known Gauss-Lucas theorem about the location of the roots of 
a complex polynomial and of its derivative. The last result is ap- 
plied to prove the well-known old conjectures of de Bruijn-Springcr 
and Schoenberg. 



1. Introduction 

Let A = A* be a selfadjoint n x n- matrix, A n _i its principal (n — 
1) x (n — 1) submatrix, obtained by deleting the last row and column. 
According to the Cauchy-Poincare interlacing theorem their spectra 
cr(A) = {Xj}i and o(A n _i) = {/U,}" -1 separate each other, that is 

Ai < Hi < A 2 < H2 < ■ ■ ■ < K-i < Hn-i < K (i- 1 ) 

It is known (see j!6j . |23p that the converse is also true, that is for any 
two sequences {Xj}™ and of real numbers, satisfying (jl.lj) . there 

exists (nonunique) n x n selfadjoint matrix A such that cr(A) = {Aj/}" 
and cr(A n _i) = {/ij}" -1 - We say that such a matrix A = A* solves the 
inverse spectral problem for these sequences. 

The result of Hochstadt, see j!5j (an analog the well known Borg 
uniqueness result for Sturm-Liouville equation) claims that there exists 
the unique Jacobi (tridiagonal) selfadjoint matrix A solving the inverse 
problem. 
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In the present paper we generalize both the Cauchy-Poincare sepa- 
ration theorem and the Hochstadt theorem to the case of normal ma- 
trices. 

It is obvious that an analogous result can not hold literally for a 
normal matrix A, first because the eigenvalues are not real and second, 
a principal submatrix A n ^i is normal only in trivial cases (see |10j and 
Lemma 14. 9|) . 

Our first main result is Theorem 13. 71 It provides necessary and suf- 
ficient geometric conditions for the sequences {Aj}™ and {a*j}i _1 to be 
the spectrum of a normal matrix A and its submatrix A n _i respectively. 

In order to formulate these geometrical conditions we introduce (in 
Section 2) several concepts of majorization for sequences of vectors from 
IR n , being natural generalizations of the classical ones and coinciding 
with those for n — 1. 

Note however, that the sufficient part of the above mentioned con- 
ditions can be expressed analytically without majorization and reads 
as follows 

c fc := n nj=l(/ij ~ Afc) x , >0, ke{l,...,n}. (1.2) 

Li.l<j<n,j^k\ A j ~ A k) 

The second main topic of our paper is concentrated around the 
known Gauss-Lucas theorem |28j . According to this theorem the roots 
{/ifc}™ -1 of the derivative p' of any complex polynomial p(e C[z]) of 
degree n lie in the convex hull of the roots {A.,}™ of the polynomial p. 

At a first view this topic is rather far from the above one. Nev- 
ertheless the following proposition establishes a "bridge" between the 
two main topics (parts) of the paper. 

Proposition 1.1. Let p(z) be a polynomial of degree n with ze- 
ros {\j}i and the zeros of its derivative. Then there exists a 
(nonunique) normal matrix A G M n (C) such that cr(A) = {A,,}™ and 

The proof is immediately implied by (jl.2|) since now c& = 1, k < n. 

Combining Proposition 11.11 with Theorem 13.71 (our solution to in- 
verse problem) and setting fi n := (Yli ^i)/ n > we immediately arrive 
(see Proposition ^. 2|) at the following result: let \i := {yUj}™ , A := {Aj}™. 
There exists a doubly stochastic n x n-matrix S such that /i = SX. 

This result essentially improves the Gauss-Lucas theorem. Proposi- 
tion ^3 allows us to apply linear algebra techniques to the investigation 
of the location of the zeros of a polynomial and of its derivative. For ex- 
ample, using the exterior algebra techniques we obtain a generalization 
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of the Gauss-Lucas theorem for the products of roots (see Proposition 
ED- 

Note, that while the second topic has attracted a lot of attention 
during two last decades (see [5] , jl] , [3 , , |26j , |27j , |31j ) , our approach 
seems to be new and perspective. In particular, in the framework of 
this approach we get simple (and short) solutions to the old problems 
of de Bruijn-Springer [S] and Schoenberg [33 . 

Let us briefly describe these problems, fixing the above notations. 

In 1948 de Bruijn and Springer 6 conjectured that the following 
inequality holds for any convex function / : C — > R : 

n— 1 _. n 

3=1 3=1 

They succeeded in proving this inequality for a class of convex func- 
tions. We provide a proof of this conjecture by showing that a bisto- 
hastic matrix S in the above mentioned representation /x = SX can be 
choosen in such a way that all entries in the last row equal 1/n. More- 
over, we prove ( Theorem 14. 6|) that the following inequality is valid for 
any k G {1, . . . , n — 1} 

t^iy E /ffl(^-«)l<A e /(n(^-*)V 

V k ) l<ii<...<i k <n~l \j=l / \k) l<i 1 <...<i k <n \j=l J 

(1.3) 

In 1986 Schoenberg |33j (see also p[|) conjectured that if YTj=i \ = 
0, then 

n— 1 n 

n5>,f<(n-2)J>/ 

3=1 3=1 

and the equality holds if and only if all numbers Xj lie on the same line. 
We establish this inequality in Proposition ll.il fsee Theorem I4.10J1 . 

Let us briefly sketch the contents of the paper. 

In section 2 we introduce two new notions of majorization for se- 
quences of vectors with nonequal numbers of entries and establish some 
simple properties of those. We also study a connection between differ- 
ent concepts of majorization and show that (on the contrary to the 
scalar case), they are not equivalent. In particular, this provides a neg- 
ative answer to the question from the book of Marshall and Olkin [24J, 
p.433. 

In section 3 we establish an analog of the Cauchy-Poincare separa- 
tion theorem for normal matrices ( Theorem 13. 7|) . As a corollary, we get 
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an analogous result for "noncommutative" convex combinations of nor- 
mal matrices (Corollary l3.11Jl . Moreover, we solve an inverse Borg-type 
problem, generalizing the result of Hochstadt j!5j . 

In Section 4 we essentially generalize and extend the known Gauss- 
Lucas theorem based on our solution to inverse problem for normal 
matrices. Finally, we apply this result in order to obtain complete 
solutions to the de Bruijn-Springer and Schoenberg conjectures. 

The preliminary version of our paper has already been published as 
a preprint |2U| . The main results of the paper have been announced 
without proofs in |21j . 
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2. Majorization 

2.1. Two definitions of majorization. 

We start with several known definitions (see [14] . |23j . [24 ). 

Notation 2.0.1. Let X be a subset of R m . Denote by conv(X) the 
convex hull of X, i.e. the smallest convex set, containing X. 

If X is convex, let ExtX denote the (nonempty) set of its extreme 
points. 

Next, CV(y) stands for the set of all convex functions on a convex 
set Y. 

As usual, denote by C := A o B the Schur (element-wise) product 
of two n x n-matrices A = (ay) and B = (6y) : (cy) := (ayfey). 

Definition 2.1. a) A matrix A e M n (R) is called bistochastic 
(doubly stochastic) iff all its entries are nonnegative and the sum of 
the elements in each row and each column equals one. 

We denote the set of bistochastic matrices by Q n C M n (R). 

b) A matrix A G M n (R) is called unitary-stochastic (orthostochas- 
tic) if there exists a unitary (orthogonal) matrix U G M n (C)(U G 
M„(R)) such that A = U o U. 

The set of all unitary stochastic matrices is denoted by il^. 

Remark 2.1. The set of all bistochastic matrices is convex and 
contains all transposition matrices. 



ZEROS OF POLYNOMIALS AND NORMAL MATRICES 



5 



The known Theorem of Birkgoff states that the set Ext(f2 n ) coin- 
cides with the set of all transposition matrices, and thus by the Krein- 
Mil'man theorem Q n is the convex hull of all transposition matrices 

Not that each unitary stochastic n x n-matrix is bistochastic, i.e. 
C Q n . But converse is not true: not every bistochastic matrix is 
unitary-stochastic [23], |24j . 

Definition 2.2. Let x = {x k } l k=1 and y = {yk}T=i be two se- 
quences of vectors in MJ 1 and I < m. Suppose that the following condi- 
tions are fulfilled: 

conv(x il , %\ = 1,1) C conv(?/j : i = l,m), 



conv^j + ...+x ik : 1 < i x < . . . < i k < I) C 
convd/^ + ... + y ik : 1 < i x < . . . < i k < m) 



xi + . . . + xi e conv(y h + . . . + y k : 1 < i\ < . . . < i t < m) 

Then we say that the sequence {y k } majorates the sequence {x k } and 
write x -< y; 

Remark 2.2. If I = m, the last condition turns into 
xi + . . . + x m = yi + . . . + y m . 

Definition 2.3. Let x :— {x k } l k=1 and y :— {yfc}™ =1 , 1 <mhe two 
sets of vectors x k , y k G W 1 . 

We say that x is bistochastically majorated by y and write x ~<ds y, 
if there exist vectors X[ + i, . . . , x m e M. n and a bistochastic matrix S G 
fl m , such that x := {x^}™ = (I n <S) S)y. 

If S can be chosen to be unitary stochastic, we write x -< u ds y- 

Next we compare these definitions to the classical ones. 
For this purpose we recall the notion of majorization ([14 , 
|23j.|24 ) for sequences of real numbers ( the case n = 1). 

Definition 2.4. Let there be given two real sequences a := {oik}™ 
and (3 := {Pk}™. Let also a and (3 be these sequences, reordered to be 
decreasing. If 

/§! + ...+ 4 < &i + ■■■ + &j, j G {1, . . . , m} (2.2) 
then it is usually written (3 -<-< a. 
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If, moreover, 

m m 
k=l k=l 

then the sequence (3 is said to be majorized by a which is denoted by 
(3 -< a. 

The following famous theorem due to Weyl, Birkgoff and Hardy- 
Littlwood-Polya (see |13j . |14j . |23j.|24J), explains the connection 
between two different definitions of majorization in the scalar case 
(n = l). 

Theorem 2.5. Let a, [3 E R m be to real sequences. Then the fol- 
lowing are equivalent 

1) (3 -<a. 

2) The following inclusion holds true: 

(3 £ conv({Aa : A is a permutation matrix}) = 
conv({{ai 1 , . . . ,ai m } : . . . ,i m } is a permutation of the set {1, . . . ,m 

3) There exists a bistochastic matrix S £ M m (R), such that (3 = Sa. 
It fact, the matrix S can be chosen to be ortho stochastic. 

4) The inequality 

m m 

(2.4) 

i=l i=l 

holds for any convex function f on R. 

5) The inequality 

f(Jh,..., P m ) </(«!,..., oc m ) (2.5) 

holds for any convex function f in R m which is symmetric, that is, 
invariant under any permutation of the coordinates. 

Remark 2.3. Note that in the case n = 1 and I = m definitions 
12.21 and 12.31 are equivalent to the above Definition 12.41 of majorization 
for real sequences. In fact, we have 

f3^a^(3 = Sa^ (-(3) ~< (-a). 

(this fact very easy to check explicitly). Now, the convex hull of a 
set of real numbers is the closed interval between the minimal to the 
maximal numbers. Thus Definition 12.21 is a natural generalization of 
the standard one in R 1 . 

It is very easy to see that the following proposition is valid 
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Proposition 2.6. If x = {x k } ^ ds y = {y k }, then x -< y. 

One can suppose, that a complete analog of Theorem 12.51 is valid, 
that is the partial orders -< and -<ds are equivalent. But the following 
example shows, that it is not the case. 

Example 2.7. Let n = 2 and m = 4. Set 

x = {x u ...,x 4 } = {(12, 12), (12, 12), (5, 3), (3, 5)}, 

y = {yi,...,y4} = {(8, 16), (16, 8), (0, 0), (8, 8)} ~' (>) 

It is easy to check by hand, that x -< y. At the same time it is easy to 
see that the vector x\ = (12, 12) can be uniquely expressed as a convex 
combination of y k -s: X\ = l/2(y 1 + 1/2)- Suppose now, that there exists 
a bistochastic matrix S, such that x = Sy. Then S has the form 

/1/2 1/2 \ 

s _ 1/2 1/2 

s 33 s 34 ' 

\0 s 43 s 44 y 

which is impossible, since x 3 ,x 4 do not belong to the convex hull of 

2/3, 2/4- 

Remark 2.4. On the p. 433 of |H] Marshall and Olkin, mention 
the condition 2) of Proposition 12.101 as the weakest possible notion of 
majorization. By virtue of Proposition 12.101 it is equivalent to -< for 
I = m. Furthermore they say that the relation between -< and -<ds is 
not clear. Example 12.71 provides a negative answer to this question. 

We note the following simple 

Proposition 2.8. a) Let {yk}T=i be such, that conv({?/ fc }^ =1 ) is 
affine isomorphic to the standard simplex 

£ m -i:={(ti,...,t m )GM m : t k >0, ^4 = 1}. 

k 

Then x = {x k } -< y = {y k } if and only if x ~< ds y. 
b) for m = 3 the orders -< and -<ds &re equivalent. 

Proof, a) Under the assumption a) each x k can be uniquely rep- 
resented as a convex combination of y k : 



m 

x k = ^2skiVi, ke{l,...,m} 
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with J2i s ki = 1- Now, by definition, x -< y yields 



771 



771 



(l/m)^y k = (l/m)^x fc = (1/m)^ 



^2 Ski y*- 



k k 1=1 



Defining 



Pi = 1/m ^s fci 




fc=i 



we get fa = 1/m for all i, since the expression via extreme points 
is unique Thus the matrix S = (ski)™ i= i is the required bistochastic 
matrix. 

b) m = 3. If the endpoints of yi,y%, y% in W 1 do not lie on the same 
line, then a) applies. If they do, then shifting them all by the same 
vector so, that the line becomes passing through the origin we reduce 
the problem to the case n — 1, contained in Theorem 12.51 □ 

Remark 2.5. It is clear, that if {yk} are linearly independent, then 
they satisfy the hypothesis of a). 

It is also interesting to note that in the case a) only the first con- 
dition Xk G conv({?/ fc }^ 1 ) and the last one ^x k = YlVk are sufficient 
for the existence of a bistochastic matrix. 

2.2. The set of extreme points of the set Maj(y). 

Let n = 1 and x = {xk}™, y = {yk}T G R m - It is known (see 
[22 ), that in this case (n = 1) the set of extreme points of the set 
Maj(y) := {x : x G R m , x -< y} is 

Ext(Ma.j(y)) = {Py : P G Q m , P is a permutation matrix}. (2.7) 

The following statement easily follows from the scalar case. 

Proposition 2.9. Let x = {x k }™, y = {yk}T be two sequences of 
vectors in K n and Maj(y) := {x : x -< y}. Then 

Ext(M&i(y)) D {(I n ® P)y : P is a permutation matrix). 

The following questions naturally arise in this connection: 
Questions: 1) Find some additional geometric conditions, such 
that together with (j2.1j) they imply x -<ds y- 

2) What are the extreme points of the set Maj(y)? 

3) Under which conditions on the sequence y = {y k } the sets {x : 
x -< y} and {x : x y} coincide? 

Let CVS(R n ) be the closed in the point-wise convergence topology 
cone in C^(R n ), generated by the set of convex functions 



{/((x,y)):/GCV(R), y G M n }. 



(2.8) 
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The class CVS(M. n ) naturally arises in the following proposition 
being a partial generalization of Theorem 12.51 

Proposition 2.10. Let x := {xj}[, y := {y k , }™ be systems of 
vectors from R n . The following conditions are equivalent 

1) x -<y; 

2) for any vector h el" 

«a?i, h),..., (xi, h)) -< ((y u h),..., (y m , h)); 

3) the inequality 

l m 
i=\ i=l 

holds true for any nonnegative f G CVS(M. n ) when I < m and for any 
f e CVS(R n ) ifl = m/ 

Proof. 1)^=^2). A vector lies in a convex set Y c R n iff 

its projection to any line lies in the projection of Y onto the same line. 
Thus 

x h + . . . + x ik G conv({?/ J1 + . . . + y jk }) 

iff 

(x h ,h) + ... + (x ik ,h) G conv({(y h ,h) + ... + (y jk ,h)}) 

for any h G M n . By Remark 12.31 this is equivalent to 2). 

2)<^=^> 3). By a result of Fisher and Holbrook {{xk,h)}[ -< 
{(yk,h)}^ if and only if 

l m 

^/((x fc ,/ i ))<^/((^,/ i )) 

k=l k=l 

for any nonnegative function / G CV(R). If I = m, we have an equality 
in (J2.9|) for any linear function. And any convex function on the line 
is a sum of a linear function and a nonnegative convex function. This 
immediately yields the required. □ 

We mention also the following beautiful result, being another partial 
generalization of the Hardy-Littlewood-Polya theorem. It is due to 
Sherman |32j for I = m and to Fisher and Holbrook for I < m : 

Theorem 2.11. [32], QTj Let x := {xj}[ and y := {yk}T be sys- 
tems of vectors in R™. Then x -<d s y if and only if the inequality 

l m 

E/(*o<E/(w) (2 - 10) 

i=l i=l 
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is valid for any nonnegative f G CV(C) if I < m and for any f G 
CV(C)z// = m. 

The proof is completely different from the scalar case. 

Note, that combining Proposition 12.101 Theorem 12.111 and Exam- 
ple Owe arrive at the relation CVS(W l ) ^ CV(M n ). Note also that 
using our Example 12.71 F. V. Petrov has constructed a simple explicit 
counterexample of a function / G CV(R n ) \ CVS(R n ). 

To finish the section, we mention the following elegant result due 
to F. Petrov. 

Proposition 2.12. 1) {x k }[ =1 -< {yk}T=i if and only if the sum of 
any s of Xi-s, 1 < s < I is a linear combination of zj-s with coefficients 
between and 1 and the sum of coefficients equal s; 

2) {x k } l k=1 -< {yk}T=i if an d on ly if there exist vectors 
xi+i, ...,i m el" such that 

-< ivk}U- 

Proof. 1) easily follows from Proposition I2.10( 2). 
2) It is easy to see from 1) that if {x k } l k=1 -< {zk}™^ then {x k } k + 2i ~< 
{zk}T=i, where x l+1 = ^jpK □ 



3. Inverse problem and interlacing theorem for normal 

matrices 

In this section we will use the partial orders -< and -<d s for vec- 
tors with complex entries. In this case we identify C with M 2 and so 
Definition 12.21 does not change. 

Let A^'lll'l* denote the submatrix of A with the ii,...,i k and 

columns j\, . . . ,j k . We denote for the brevity A n _i := A\'"'^Zx- 

3.1. Preliminary solution to the inverse problem by two 
spectra. 

Proposition 3.1. Let {A*.}™ and {^j}™' 1 be two sequences of com- 
plex numbers. Then the system of inequalities: 



Lj=l 



>0, ke{l,...,n} (3.1) 



is valid if and only if there exists a normal matrix A with the spectrum 
o~(A) = {Ai,...,A n } such that the spectrum of A = A n _x is cr(A) = 
{fix, . . . , /i n -l}- 
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Proof. Necessity. Let A be a normal matrix with the spec- 
trum <t(A) = {Ai,...,A n } such that <r(A) = {//i, . . . , /i n -i}- Let 
e = (0, . . . , 0, 1). Consider the function: 

4(A),((A-n.) = E^ = ^J (3.2) 

where Xk are the coordinates of e in the orthonormal basis of eigenvec- 
tors of A. Clearly, the poles of A(A) are in the spectrum of A and the 
residues in these poles are equal to x\ and hence are nonnegative. But, 
by ()3.2|) these residues equal the numbers ()3.ip . 
Suf ficiency. Let (|3.1j) ne fulfilled. Consider the function 

A(A) := rua^a) (33) 

By (13. lj) . the residues of A(A) in its poles A& are nonnegative and hence 
equal for some real numbers x^. Clearly, we have 

n 2 
fe=l fc 

and 

n 

= lim -AA(A) = 1. 

' ^ A^oo 
k=l 

Therefore, considering the diagonal matrix A = diag{Ai, . . . , A n } and 
writing it down in an orthonormal basis with the last vector e n = 
(x\, . . . , x n ) we get the required normal matrix. □ 

Remark 3.1. Note, that the poles of the function A(A) (see f)3.2|l ) 
are simple (i.e. of multiplicity one), since the matrix A is (unitary) 
diagonalizable. Therefore it easily follows from ()3.2|) . as well as from 
general dimension arguments, that if A has a fc-multiple eigenvalue 
Ao, then Ao is an eigenvalue of A m _i of multiplicity at least k — 1. 
Note, however, that A n _\ is normal only in very special cases (see 
Lemma f4.9p . Moreover, it may even happen that A n _i is not of simple 
structure, that is it may be nondiagonalizable. 

Corollary 3.2. Let two systems of complex numbers {/ij}™ -1 and 
{Xj}^ satisfy 

A m ._ n?=iVj-A) _ " |x,| 2 
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with some complex numbers {xj}™- Then for any unitary matrix U = 
( u ij)i,j=i w ^ the last row (u n i, . . . , u nn ) = (xi, . . . ,x n ), the matrix 
A := U diag{ A .;}"£/* satisfies the hypothesis of Proposition HOI 

3.2. Quasi-Jacobi normal matrices and an analog of the 
Hochstadt theorem. 

It is known and easy to see that any selfadjoint matrix (bounded 
operator) is unitary equivalent to a selfadjoint tridiagonal (Jacobi) ma- 
trix. 

Here we find an analog of such a form for a normal matrix and 
apply it in order to obtain an analog of the Hohstadt result j!5j on the 
unique recovery of a Jacobi matrix from two spectra. 

Proposition 3.3. Every normal m x m matrix A is unitary equiv- 
alent to a direct sum of normal matrices Ai, i = 1, . . . , k satisfying 
(Ai)j t k = for k > j + 2 and (Ai)jj + \ ^ 0. Moreover, A has simple 
spectrum iff it is unitary equivalent to only one such matrix. 

Proof. The proof is very simple and standard. It is clear that it 
suffices to consider only the case of simple spectrum. 

In this case, taking any cyclic vector x, we get that {A J x}™ _1 forms 
a basis in C' m . After the Gram-Schmidt procedure we arrive at the 
required basis. □ 

Definition 3.4. A matrix A is called quasi- Jacobi if it satisfies the 
hypothesis of Proposition 13.31 

Thus, quasi- Jacobi form is in a sense a normal form for a normal 
matrix. Now we can complement Proposition 13.11 with a uniqueness 
result. 

Theorem 3.5. For any two systems of complex numbers {A.,}™ and 
satisfying (j3.1|) there exists a unique normal quasi-Jacobi ma- 
trix A such that cr(A) = {Aj}™ and a(A n ^i) = {/ij}™ -1 . 

Proof. Writing the function A(A) from (J3.3)) in the form 

dfi(z) 



A(A) , A 

with d\i = Ylk=i x \^k we can introduce the orthogonal polynomials 
with respect to measure just like in the Jacobi case (see, e.g. pQ and 
Gesztesy and Simon [12]) and then, following the same lines as in [12J, 
we get the result. □ 

Remark 3.2. The function A (A) is an analog of the Weyl M- 
function in this case (see |12j . [T7j). Note that our proof of Proposition 
13. II is similar to that proposed in [12J, [17 . 
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3.3. The set of all possible diagonals in the "unitary" orbit 
of a normal matrix. 

The criterion (j3.1|) of Proposition 13.11 is trivial and provides no 
information on the geometry of the sequences {Xk}i and {fJ>j}i ■ It is 
even unclear how far can //j lie from A&. Therefore it would be desirable 
to have a more "geometric" answer, being an analog of the Poincare 
Theorem. 

In this subsection we start with an arbitrary normal matrix A and 
give a (rather trivial) description of the set of diagonals of its "unitary" 
orbit {UAU* : U G M n (C), U*U = I.} In the next section we apply 
this result to complete solution to the inverse spectral problem for a 
normal matrix. 

Proposition 3.6. Let A g M n (C) be a normal matrix with the 
spectrum (Ai, . . . , A n ). Then 

a) (an, ■ ■ • 5 o-nn) -<uds (Ai, . . • , A n ); 

b) there exists an orthonormal basis {e^}™ such that (Aei, e») = ctj iff 
there exists a unitary stochastic matrix O such that col(ai, . . . , a n ) = 



c) if A is self adjoint, then the set of all possible diagonals (in all 
orthonormal bases) is convex; 

d) the set of all possible diagonals (in all orthonormal bases) of a 
fixed normal matrix A is not necessarily convex. 

Proof. The validity of a) and b) is obvious. It is also clear that all 
permutations of the set (Ai, . . . , A n ) are realized by diagonals. Thus, 
if the set of diagonals were convex, it would contain all vectors of the 
form Sco\(Xi, . . . , A n ) with a bistochastic S. 

c) is due to Horn |16j . 



It is known (see |23p that S is not unitary-stochastic. Set (Ai, A 2 , A 3 ) = 
(1, i, 0) and 

S(Xi, A 2 , A 3 ) = (ai, a 2 , a 3 y = (1 + i, 1, if. 
If there exists a unitary-stochastic matrix O, such that 

0(Ai, A 2 , A 3 )* = («i, a 2 , a 3 y, 
then it immediately yields O = S. □ 

Remark 3.3. The result of c) is a due to Horn |25j. [23]. [Tij. ["0)1. 

All other statements are folklore. 



Ocol(Ai, . . . A n ). 



d) take 
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3.4. Analog of the Cauchy-Poincare interlacing theorem 
and a solution to the inverse spectral problem for normal ma- 
trices. 

Now we are ready to state the main result of the section. 
Define for any vector {Aj}™ G C m the vector 

Ck{{Xj}T) ■= {K ■ ■ ■ Kh<h<...<i k <m e C(*). (3.6) 

Theorem 3.7. Let {Ai, . . . , A ra } and {/xi, . . . , /V-i} be two systems 
of complex numbers. Then for the existence of a normal matrix A 
such that a (A) = {Ai,...,A n } and a(A ln ^ 1 ) = . . . , /x n _i} it is 
necessary that the condition 

Chdfij - ajr 1 ) -<uds C fe ({A, - a}1) (3.7) 

be fulfilled for any complex number a G C and any k G {1, . . . , n — 1} 
and sufficient that it be fulfilled for k = n — 1 and all a G {Ai, . . . , A^}. 

Proof, a) Sufficiency. Let a = A&. (13. 7|) for k = n — 1 reads 
~ ^k) e conv I J | {A/ - A fc } : I = 1, n J 

1=1 \l<i<n,j^« / 

= convlO; J] {Aj - A fc } j (3.8) 

Hence (13.1)1 is valid for all fc. Proposition 13. II yields the required. 

b) Necessity. Let A be a normal matrix with the spectrum 
a (A) = {Ai,...,A n } and a{A n _x) = {/xi, . . . ,/i n -i}- Let us prove 
(13. 7H . By the Shur Theorem |23j . j!6j there exists a unitary matrix 
V\ G M n _i(C) such that the matrix V*Ax jn -iVi is upper triangular. 
Therefore, considering the matrix U\ := Vi©l G M n (C) we get the nor- 
mal matrix B := U*AUi with the same spectrum as A, cr(B) = cr(A), 
but the [ij-s are on the diagonal. Therefore we can take B instead of 
A. Proposition 13.61 implies {/ij}™ -1 ~< u ds {Aj}™. 

Take an arbitrary a G C. The matrix B — ai is also normal. Let 
us consider its exterior power Ck(B — at) := A k (B — ai) acting on the 
sp&ceA k H := H A — ■ A H y . Then this matrix is also normal with the 

k times 

spectrum 

a(C k (B - ai)) = {(Ai! -«)••■ (A ifc - a)}i<ii<...<i fc <n- 

The diagonal elements of Ck{B — a I) are the k x k principal minors of 
B — al. Since B n _\ — a.I n _\ is upper triangular, then the k x k principal 
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minors of -B n _i — al equal 

(fi h - a) ... {ni k - a), 1 < zi < . . . < i k < n - 1. 

Thus Proposition IH.fi[ a) implies the required. □ 

Remark 3.4. It is easy to see from the proof that the unitary 
stochastic matrices in ()3.7j) are independent of a. 

Corollary 3.8. There exists a normal matrix A such that <r(A) = 
{Ai, . . . , A n } and a(A n ^x) = {fix, jU n _i} if and only if 

n-1 

\\{\ - A*fe) e conv(0; p'(\j)) for all j G {1, . . . , n} 

fc=l 

where p(X) = Hl =1 (X - X k ) 

Example 3.9. In the case n = 3 the orders -< and -<d s are equiv- 
alent. Therefore in this case conditions ()3.7|) take a specially simple 
form: 

/ii, /i 2 e conv{Aj}i, fix + n 2 ^ conv{Ai + A 2 , A 2 + A 3 , Ai + A 3 }, 
yUi/i 2 - + ii 2 ) G conv{A fc A p - a(A fc + Ap)}i< fe< p< 3 , a G C. 
An immediate consequence is: 

Corollary 3.10. Let A be a normal matrix with the spectrum 
{Ai,...,A n } and P an m(< n) — dimensional orthoprojection in C n . 
Let also B := PA\PC n and cr(B) := (fix, ■ ■ ■ ,f^m)- Then 

ChiiH ~ "ID ^uds C k ({Xj - a} n x) (3.9) 
for all k, 1 < k < m. 

Corollary 3.11. Let A { g M n .(C), i = l,...,p be a p -tuple of 
normal matrices, and o~(Ai) = {A^,}^ their spectra. Let Si be m x m 
matrices, i — l,p such that 

p 

'£s;s i = i m (3.io) 

i=i 

is the identity operator in C m . Consider 

v 

B = Y,S*AS t (3.11) 

i=i 

and let a(B) = (fix, . . . , fi m ). Set n = nx + ■ ■ ■ + n p and 

(Ai, . . . , A n ) = (A}, . . . , A ni , Ai, . . . , \ 2 n2 , . . ., X p np ). (3-12) 

Then the systems of numbers (Ai, . . . , A n ) and (fix, ■ ■ ■ , fi m ) satisfy con- 
ditions ()3.9jl . 
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PROOF. Consider the normal matrix A := @\ =1 Ai. Condition ()3.1()j) 
means that the operator 

\S n 

is an isometry and hence the operator 

B = V*AV 

is unitary equivalent to PA\PC n where P is the projector onto the 
image of V. Corollary I3.1UI completes the proof. □ 

Remark 3.5. There is another way to implement the trick of going 
from Corollary 13. 101 to Corollary 13.111 See 



4. Location of roots of a polynomial and of its derivative 

4.1. Generalization of the Gauss-Lucas Theorem. 

Recall the known Gauss-Lucas theorem 

Theorem 4.1. The roots of the derivative p' of a polynomial p G 
C[z] lie in the convex hull of the roots of p. 

Numerous papers are devoted to different generalizations and im- 
provements of this results (see e.g. [5], [2], |31j). 

In what follows we denote by p G C[z] a polynomial of degree n with 
complex coefficients. Let also {A,,}™ be the roots of p and {f^j}^ 1 the 
roots of its derivative p'. We set additionally /i n := (J^™ \j)/n. Then 
the Gauss-Lucas theorem reads as follows: there exists a stochastic (by 
rows) matrix 5" such that fi = S'X, where [i := {/i?}™ and A := 

The following result, being a corollary of Theorem 13.71 improves 
the Gauss-Lucas theorem. 

Proposition 4.2. Let p(G C[z]) be a degree n complex polynomial 
with roots {Aj}™ and {/^fc}™ -1 the roots of its derivative p' . Then the 
the sequences {A^}™ and {^k)i~ l satisfy (|3.7|) for k G {1, . . . ,n — 1} . 
In particular, there exists a matrix S G £l n such that // = SX, that is, 
the vector fi is bistochastically majorized by the vector A. 

Proof. It follows from Corollary 13.21 and the obvious identity 

p\z)/p{z) = Y,ll{z-\) (4-1) 

that there exists a normal matrix A such that o~(A) = {Aj}" and 
cr(A n _i) = {/ij}™ -1 . One completes the proof by applying Theorem 
O □ 
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Note that this corollary does not give a complete information on 
the location. In particular, it does not anyhow explain the identities: 

k k 
n ~ l l<i 1 <...<i k <n-l j=l n l<i 1 <...<i k <n j=l 

which mean that even the products of the roots are equally dis- 
tributed. 

It turns out that in this case we can obtain a more complete infor- 
mation on the matrices S G realizing majorization. 

Theorem 4.3. Let k G {1, . . . , n - 1}, a := ( n ~ 1 ) , b := Q . Then 
there exists a matrix S k = (sijk) G ^ such that Y^i=a+i s ijk — k/n 
for all j G {1, . . . , b} and C k ({fij - a}^ 1 ) = PS k C k ({Xj - a}?) (see 
()3.fij) ), where P : C b — > C a is the natural orthoprojection. 

In particular, if k — 1 then there exists S\ G such that s n ji = 
1/n, j G {1, . . . ,n} and = S^A,-}?. 

PROOF. Set D : = diag(A 1; . . . , A n ). By Corollary l3~21 and (|4~Tj) we 

have, that for any unitary matrix V with the last row consisting of 
1/y/n, the normal matrix A := VDV* solves the inverse problem for 
a pair of sequences {A^}™ and {fJ>j}i~ , that is a (A) = {A-,}" and 
cr(v4 n _i) = {/^j}^ 1 - Let, further U\ be the same unitary matrix as in 
the proof of Theorem 13.71 Then U := t/jV(G M n (C)) is a unitary ma- 
trix with the last row consisting of 1/ \fn. Moreover, it follows from the 
proof of Theorem O that {^}™ = S^Xj}^, where S ± := U o U(e fi") 
is the unitary stochastic matrix with the last row consisting of 1/n. 

Therefore, passing to the exterior powers as in the proof of Theorem 
13.71 we get that the unitary stochastic matrix 

S k :=C k (U)oC k (U), ke{l,...,n} 

realizes the unitary stochastic majorization of the systems of numbers 
Cfc({/Zj}™ -1 ) and C k ({Xj}™)' . We prove that S k has additional proper- 
ties 

£ 

1 <ii <•••<»&— i<n— 1 

Because of the symmetry, the same identity is certainly valid for all 
other choices of k columns. 

Expanding each minor with respect to the last row we get 

k 

U 1 '-' k = — ^(_ 1 )i+i U h---> i r-> k (4.4) 
v l=l 



uh 



-,ik- 



n — K 



n 



(4.3) 
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and thus 

k 



Uh-' k . | 2 = - V U l '-A' k r 1 " k (4.5) 



«i,...,ifc_l,nl 



p,g=l 

Consider now the matrix V := U\'"'^_ v Then, since U is unitary, we 
get 

i k 



T := VV = {%}* i=1 



with 



Therefore we get 



c fc _ 1 (y)*.c fc _ 1 (y) = c fc _ 1 (r). 



Because of ([4.5)1 . one easily sees that the required sum in f)4.3|) is a 
linear combination of the elements of the matrix Ck-\{V)* • Ck-i(V). 
Thus this sum depends only on the matrix T, is independent of the 
choice of U and is the same for all other choices of k columns. 

But by (|4.2j) . these sums can be nothing but (n — k)/n. The proof 
is complete. □ 

COROLLARY 4.4. Let U be an n x n unitary matrix with u n j = 
\j\fn. Let Ck(U) be its k-th exterior power. Then for k < n Sk '■ = 
Ck{U) o Ck{U) is a unitary stochastic matrix, satisfying 

( k ) /n-1 



i=l 



/or ««JG{1,...,(;)}. 

4.2. Conjecture of de Bruijn-Springer and its generaliza- 
tion. 

In 1947 de Bruijn and Springer |Hj conjectured that the inequality 



-. n— 1 _. n 

n — 1 -r- ' n ' 



j=i j=i 



holds for any convex function / : C — > R. 

In order to prove this conjecture as well as its generalization we 
need the following simple lemma. 
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Lemma 4.5. Let {xj}±, {yj}i be two sequences of vectors fi 



rom 



Suppose that there exists a matrix S = (s i<7 -) 6 R fexn io^/i non-negative 
entries and such that {xj}\ = (S ® im){%}" 

n k k 

Jjsij = 1, i 6 {1, ..,*;} «^ Jjsij- = j e {l,...,n}. (4.7) 

j=l i=l 



Taen iae inequality 



-i k 1 n 

/to) ^ 



j'=i i=i 
/ioWs inte /or any function f 6 CV(R m ). 

Proof. One obtains the proof by combining the Jensen inequality 
with relations (|4.7j) . □ 

The following result contains in particular a positive solution to the 
conjecture of de Bruijn and Springer [B]. 

Theorem 4.6. The following inequality holds true for any convex 
function f : C — > R and any fc, 1 < < n : 

fc 




fiir 2. /MlK-«))<7^ 2. /(11(A;.-m) 

V fc / l<ii<...<i fe <n-l \j=l J vfcy l<i 1 <...<i fc <n \j=l 



A e /(n<*. 

(4.9) 



Proof. The inequality immediately follows by combining Theorem 
IP1 with Lemma H3J □ 

Remark 4.1. In the case k — 1 inequality (|4.9jl coincides with 
inequality ()4.6|) . that is with the de Bruijn-Springer conjecture jH]. 



Remark 4.2. Acording to the result of Sherman |32j . the existence 
of a x n matrix S, satisfying the hypothesis of Lemma f4. 51 is actually 
equivalent to the validity of inequality ()4.8j) for each function / e 
CV(R m ). 

4.3. The Schoenberg conjecture. 

Now we are ready to prove the famous Schoenberg conjecture 

mm 

We will need two Lemmas. The first one is known 1231: . but we 
present it with a proof for the reader's convenience. 
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Lemma 4.7. Any matrix A = (aij)ij=i M n (C) with spectrum 
cr(A) = satisfies the inequality 

n n 

£|Ail a <UI4ll=£l«tfl 2 (4-10) 

j = l i,j = l 

and the equality holds if and only if A is normal. 

Proof. The inequality (|4.10|) is known. It is clear that the equality 
holds true for a normal matrix. 

Conversely, let A satisfy the equality. By the Schur theorem A 
is unitary equivalent to an upper triangular matrix with Xj-s on the 
diagonal. Since ||v4||2 is unitary invariant, this matrix will be diagonal, 
that is A is normal. □ 

Lemma 4.8. Let e = e 27Ti/n and U = n~ 1/2 (e fc &' -1) )jk =1 . Let also 
Y^j=i -\/ = 0. Define 

n 

r(z):=£A^'- 1 

3=1 

and 



1 



A := C/diag(A,)^ =1 [/* = ^(r(e*-'"))2j=i =: (%■)";=!■ 

Then A is a normal matrix with spectrum cr(A) = {A.,}™ and a(A n _i] 
{/%}™ - • Moreover the following identity holds true 



nlK^U 2 . = (n-2)\\A\\ 2 2 . 



Proof. The first statement follows from Corollary 13.21 since the 
last row of U consists of It remains to prove the last identity. 

It is easy to see that ajj = and a n j = a n _k,j-k for k < j and 
a nj = a k-j,k for k > j. Threfore the required identity takes the form 



n n—1 
3=1 3=1 



a n j | 



But 



where 



n-l ( _ ~\\ n 1 

£ Kf = ^ E + n 2 £ * ( A * A >-) 

j=i j=i j<j 



n-l 

n 



n-l 



fe=i 
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for all 1 < i 7^ j < n. But we have 



i<j j=l 

This completes the proof. □ 

The next lemma is due to Fan Ky and Pall [10 . 

Lemma 4.9. Let A be a normal matrix such that its submatrix A n _i 
is also normal and A ^ A n _i © a nn . Then all the eigenvalues of A lie 
on the same line. 

The following result has been conjectured by Schoenberg |33j (see 
also 0). 

Theorem 4.10. Let E"=i A i = °- Then 

n— 1 n 

n^| Mi | 2 <(n-2)^|A,| 2 

3=1 3=1 

and the equality holds if and only if all the numbers Xj lie on the same 
line. 

Proof. Combining Lemma 14. 71 with Lemma [4.81 we get 

n— 1 n 

n N 2 < n\\A n ^\\l = {n- 2)\\A\\\ = (n - 2) £ |A,f. (4.11) 

3=1 3=1 

Moreover, by Lemma 14.71 identity (|4.11|) holds if and only if A n _i is 
normal. On the other hand, by Lemma f4. 91 this is possible if and only 
if all Xj-s lie on the same line. □ 



4.4. The Mason-Shapiro Polynomials. In [25J Gisli Masson 
and Boris Shapiro initiated study of a class of differential operators Tq 
defined as follows: let Q be a degree k monic poplynomial. Then Tq is 
defined via 

T Q : {Qf) {k) - 
They have shown that for each m there exists a unique polynomial 
eigenfunction p m of Tq of degree m. Moreover, 

TQp m = X m ,kPm (4-12) 

and A TO) fe depends only on k, m, namely 

X m ,k = (m + l)(m + 2) • • • (m + k). 

One of the results of 25] is the following interesting analog of the 
Gauss-Lucas Theorem: 
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Theorem 4.11. The zeros of p m are contained in the convex hull 
of the set of zeros of Q for each m. 

The authors have also made a number of beauteful conjectures 
about the asymptotic distribution of the zeros of p m -s, recently proved 

in m 

We strengthen Theorem 14.111 in the following way: 

Theorem 4.12. Let {zj}j =1 be the zeros ofQ and {wj}f the zeros 
of p m . Then 

(i) there exists an m x k matrix S with Y^j=i s ij = 1 f or a ^ * 
{1, . . . , m} and YlT=i s *i = m A f or a ^ 3 ^ {!>••• j ^} an d suc ^ that 

{ Wj }? = Sfafi, (4.13) 

(ii) for any convex function f : C — > R : 

g£i/W < g^i/yji) (414) 

m ~~ 

Proof. Applying Theorem 14. 31 fc times to the polynomial Qp m and 
its consequentive derivatives we arrive at the representation 

{w j }? = S{c6L({w j }?,{z j }*)) (4.15) 

with m x (m + k) matrix (S) = (s'^) being a product of the k corre- 
sponding matrices and satisfying 

k m 

Y^Sy = 1, i e {l,...,m} and Vs^ = — — , j <E {1, . . . , m + k}. 

3=1 i=l 

Now applying Lemma [4.51 we arive at the inequality 

1 m 1 / k m 

- E fM ^ t— E + E /K 

i=i \i=i i=i 

which yields JHHJ). 

It is not diffifult to construct the matrix S satisfying ()4.13j) and 
the other required properties by resolving the last identity for Wj-s in 
(|4.15|) . But its existence is immediately implied by (|4.14j) due to the 
result of Sherman |32j (see Theorem I2.11j) . □ 

Remark 4.3. In the case of real numbers Wj and z 3 - inequality 
(|4.14|) has been mentioned (without proof) by Harold Shapiro (30] . 

Final remarks. 1) Let A e M„(C) be a normal matrix, e G C" a 
vector and P the orthoprojection onto the orthogonal complement of 
e. It is not difficult to construct examples of a nondiagonalizable A e : = 
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PA\P<C n . It would be interesting to investigate the Jordan structure 
and other similarity (or unitary) invariants of the operator A e . 

2) We do not know whether the relations ()3.9j) are also sufficient. 

3) The most famous unsolved conjectures connected with the Gauss- 
Lucas Theorem are the conjectures of Sendov and Smale (see |31j for 
a survey on this topic). 

4) the relative location of the zeros of p(z), p'(z) and p"(z) may be 
very nontrivial even in the case of real roots [29 . 

5) Some very interesting relations between the zeros of polynomials, 
their derivatives and majorization are studied in a recent paper 0] 
by J. Borcea and B. Shapiro. They formulate many interesting open 
problems. 
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